On Maximally Recoverable Local Reconstruction Codes
نویسندگان
چکیده
In recent years the explosion in the volumes of data being stored online has resulted in distributed storage systems transitioning to erasure coding based schemes. Local Reconstruction Codes (LRCs) have emerged as the codes of choice for these applications. An (n, r ,h,a,q)-LRC is a q-ary code, where encoding is as a two stage process. In the first stage, h redundant parity symbols are generated from k data symbols. In the second stage, the k+h symbols are partitioned into sets of size r −a and each set is extended with a redundant symbols using an MDS code to form a local group. Local groups ensure that when at most a coordinates are erased, any missing coordinate can be recovered by accessing at most r − a symbols. Also, if a larger number of coordinates is erased, the missing symbols can be recovered by potentially accessing all remaining symbols. An (n, r ,h,a,q)-LRC code as above is Maximally Recoverable (MR), if it corrects all erasure paerns which are information theoretically correctable given the presence of local groups. Obtaining MR LRCs over finite fields ofminimal size is important in practice and has been the goal of a line of work in coding theory. In this work we make progress towards this goal. In particular: – We show that when a and h are constant and r may grow, for every maximally recoverable LRC, q > Ωa,h ( n · rmin{a,h−2} ) . Prior to our work, there was no super-linear lower bound known on the field size of MR LRCs for any seing of parameters. – We obtain a family of MR (n, r ,h = 2,a,q)-LRCs, where q = O(n) for all seings of parameters. Prior to our work the best constructions required q to be quadratic in n for some regimes. – We obtain a family of MR (n, r ,h = 3,a,q)-LRCs, where q = O(n) for all seings of parameters. Prior to our work the best constructions required q to be nΘ(a) for some regimes. – Our results in the first two bullets above suggest the seing of r = 3,a = 1,h = 3 as the first seing where existence of MR LRCs over fields of near linear size is an open question. We resolve this question in the positive by developing a new approach to LRC constructions based on elliptic curves and arithmetic progression free sets. ∗Research supported by NSF CAREER award 1451191 and NSF grant CCF-1523816. Most of this work was done when the author was visiting Microso Research. †Research supported in part by NSF grant CCF-1563742. Most of this work was done during a visit by the author to Microso Research, Redmond. e work was also partly done when the author was visiting the School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore.
منابع مشابه
Maximally Recoverable Codes for Grid-like Topologies
The explosion in the volumes of data being stored online has resulted in distributed storage systems transitioning to erasure coding based schemes. Yet, the codes being deployed in practice are fairly short. In this work, we address what we view as the main coding theoretic barrier to deploying longer codes in storage: at large lengths, failures are not independent and correlated failures are i...
متن کاملOn Maximally Recoverable Codes for Product Topologies
Given a topology of local parity-check constraints, a maximally recoverable code (MRC) can correct all erasure patterns that are information-theoretically correctable. In a grid-like topology, there are a local constraints in every column forming a column code, b local constraints in every row forming a row code, and h global constraints in an (m × n) grid of codeword. Recently, Gopalan et al. ...
متن کاملCommunication Cost for Updating Linear Functions when Message Updates are Sparse: Connections to Maximally Recoverable Codes
We consider a communication problem in which an update of the source message needs to be conveyed to one or more distant receivers that are interested in maintaining specific linear functions of the source message. The setting is one in which the updates are sparse in nature, and where neither the source nor the receiver(s) is aware of the exact difference vector, but only know the amount of sp...
متن کاملCommunication Cost for Updating Functions when Message Updates are Sparse: Connections to Maximally Recoverable Codes
We consider a communication problem in which an update of the source message needs to be conveyed to one or more distant receivers that are interested in maintaining specific linear functions of the source message. The setting is one in which the updates are sparse in nature, and where neither the source nor the receiver(s) is aware of the exact difference vector, but only know the amount of sp...
متن کاملLabeling the complete bipartite graph with no zero cycles
Maximally recoverable codes are codes designed for distributed storage which combine quick recovery from single node failure and optimal recovery from catastrophic failure. Gopalan et al [SODA 2017] studied the alphabet size needed for such codes in grid topologies and gave a combinatorial characterization for it. Consider a labeling of the edges of the complete bipartite graph Kn,n with labels...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Electronic Colloquium on Computational Complexity (ECCC)
دوره 24 شماره
صفحات -
تاریخ انتشار 2017